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The newly emergent Middle East respiratory syndrome coronavirus 
(MERS-CoV) can cause severe pulmonary disease in humans”, repre- 
senting the second example of a highly pathogenic coronavirus, the 
first being SARS-CoV’. CD26 (also known as dipeptidyl peptidase 
4, DPP4) was recently identified as the cellular receptor for MERS- 
CoV’. The engagement of the MERS-CoV spike protein with CD26 
mediates viral attachment to host cells and virus—cell fusion, thereby 
initiating infection. Here we delineate the molecular basis of this 
specific interaction by presenting the first crystal structures of both 
the free receptor binding domain (RBD) of the MERS-CoV spike 
protein and its complex with CD26. Furthermore, binding between 
the RBD and CD26 is measured using real-time surface plasmon 
resonance with a dissociation constant of 16.7 nM. The viral RBD 
is composed of a core subdomain homologous to that of the SARS- 
CoV spike protein, and a unique strand-dominated external receptor 
binding motif that recognizes blades IV and V of the CD26 [-propeller. 
The atomic details at the interface between the two binding entities 
reveal a surprising protein-protein contact mediated mainly by 
hydrophilic residues. Sequence alignment indicates, among beta- 
coronaviruses, a possible structural conservation for the region 
homologous to the MERS-CoV RBD core, but a high variation in 
the external receptor binding motif region for virus-specific patho- 
genesis such as receptor recognition. 

The recent identification of a novel coronavirus, MERS-CoV— 
which, as of May 15th 2013, had infected 40 patients with a total of 
20 fatalities—has drawn worldwide attention as a potential cause of a 
future pandemic’. Unlike most coronaviruses circulating in humans 
that only cause mild respiratory illness’, MERS-CoV possibly repre- 
sents a second reported coronavirus of severely high virulence after 
SARS-CoV, which caused over 8,000 infection cases globally in 2003, 
with more than 800 deaths’. The clinical manifestations of MERS-CoV 
infection include fever, cough, acute respiratory distress syndrome 
and, in some cases, accompanying renal failure'’, and are very similar 
to those caused by SARS-CoV. However, the novel coronavirus diverges 
from SARS-CoV in genomic sequence, and is much more closely related 
to the bat-derived HKU4 and HKU5 coronaviruses”*. Consistent with 
phylogenetic analysis, MERS-CoV does not use the SARS-CoV receptor, 
angiotensin converting enzyme 2 (ACE2), as its entry receptor’; rather, a 
recent study showed that it uses human CD26 for this purpose*. CD26 is 
the third peptidase to be identified as a functional coronavirus receptor, 
the others being aminopeptidase N (ANPEP, also known as APN and 
CD13)'°" and ACE2 (ref. 12). 

The recognition of CD26 by MERS-CoV is mediated by virus surface 
spike (S) protein*. As with other coronaviruses, the MERS-CoV S pro- 
tein would be cleaved in host cells into $1 and $2 subunits (Fig. la). $1 
engages the receptor* whereas S2, with typical sequence motifs homo- 
logous to those identified as the heptad repeats in class I enveloped 
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viruses'*'°, should mediate membrane fusion. The exploitation of the 


virus—-receptor interaction and thus of the intervention strategies 
requires an atomic delineation of the receptor-binding properties of 
S1. On the basis of previous studies, the receptor attachment sites of 
coronavirus S1 subunits might locate to either the amino-terminal 
(such as in murine hepatitis virus’®) or the carboxy-terminal (such as 
in, for example, SARS-CoV”’ and human coronavirus NL63 (ref. 18)) 
domain. We therefore tested individually the binding of MERS-CoV S1 
and its N- and C-terminal-domain proteins to cell-surface-expressed 
CD26 molecules. The receptor-binding capacity was attributed to the 
C-terminal amino acids 367-606 of MERS-CoV S1 (Fig. 1b). We hereby 
referred to this domain as RBD. The potent interaction between MERS- 
CoV RBD and CD26 was further demonstrated by surface plasmon 
resonance assays, in which CD26 binds to MERS-CoV RBD with a 
dissociation constant (Kj) of about 16.7 nM (K.,, 1.79 X 10° M's _'; 
Kopp 2.99 X 10° °s'), but does not bind to the RBD of SARS-CoV 
(Fig. 1c). 

We crystallized MERS-CoV RBD and solved its structure at a reso- 
lution of 2.5 A (Supplementary Table 1). Two molecules of essentially 
the same structure are present in the asymmetric unit. Each molecule 
contains 208 consecutive density-traceable amino acids from V381 to 
L588. A Dali’? search within the Protein Data Bank (PDB) revealed 
clear structural homology between MERS-CoV RBD and SARS-CoV 
RBD (PDB code, 2DD8; Z score, 15.1). We therefore divided the MERS- 
CoV RBD structure into two subdomains: a core and an external 
B-sheet, using the structure of SARS-CoV RBD as a reference. The core 
subdomain reveals a five-stranded antiparallel $-sheet (B1, B3, B4, B5 
and $10) in the centre. The connecting helices (four «-helices: 71-4 and 
two 3, -helices: n1 and 12) and two small B-strands ($2 and B11) 
further decorate the sheet on both sides, together forming a globular 
fold. Three disulphide bonds, connecting C383 to C407, C425 to C478, 
and C437 to C585, respectively, stabilize the core-domain structure 
from the interior. At the solvent-exposed side, the RBD termini are 
clinched adjacent to each other (Fig. 2a, b). This subdomain fold is very 
similar to that of the SARS-CoV RBD core (a root mean squared devi- 
ation of 2.79 A for 76 Co pairs). Superimposition of the two structures 
reveals a well-aligned centre sheet and homologous peripheral helices 
and strands, although several intervening loops are observed to exhibit 
large conformational variance (Fig. 2c). 

The external subdomain of MERS-CoV RBD is mainly a B-sheet 
structure with three large (B6, B8 and 89) and one small (87) strand 
arranged in an antiparallel manner. It is anchored to the RBD core 
through the 85/6, B7/8 and B9/10 intervening loops, which touch the 
core subdomain like a clamp at both the top and bottom positions. 
Two small 319 helices (n3 and 14) and most of the connecting loops in 
this subdomain locate on the interior side of the sheet, hence exposing 
a flat exterior sheet-face to the solvent. Residues C503 and C526 form 
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Figure 1 | Identification of the MERS-CoV RBD. a, A schematic 
representation of the MERS-CoV S protein. The N-terminal domain (NTD) 
and RBD are defined on the basis of a pairwise sequence alignment with the 
N-terminal galectin-like domain of murine hepatitis virus S and the RBD of 
SARS-CoV S, respectively. The remaining domain elements are 
bioinformatically defined on the basis of the web-server predictions (signal 
peptide (SP), SignalP 4.0 server; transmembrane domain (TM), TMHMM 
server; heptad repeats 1 and 2 (HR1 and HR2), Learncoil-VMF program). 

? denotes the presumed/estimated S1/S2 cleavage site. A previous prediction* 
indicates cleavage between R751 and S752 with a 602-residue $2. However, a 
recent study”® revealed a spike C-terminal domain (possibly $2) of ~100 kDa, 
indicating a cleavage site upstream of R751/S752. b, A flow cytometric assay of 


the fourth disulphide bond, linking the n3 helix to strand B6 (Fig. 2a, b). 
With no observable structure homology (Fig. 2c), the external subdo- 
mains of MERS-CoV and SARS-CoV RBDs are topological equivalents, 
both being present as an ‘insertion’ between the equivalent core-strands 
(strands 85 and B10 in MERS-CoV, and £6 and B9 in SARS-CoV) 
(Supplementary Fig. 1). 

To elucidate the structural basis of the virus-receptor engagement, 
we further prepared the RBD-CD26 complex by in vitro mixture of the 
two proteins and then purification on a gel filtration column. Con- 
sistent with the high binding affinity between MERS-CoV RBD and 
CD26, the complex is easily obtainable and stable (Supplementary Fig. 2). 
The complex structure was solved at 2.7 A resolution (Supplementary 
Table 1) with one RBD binding to a single CD26 molecule in the asy- 
mmetric unit. The receptor, as shown in previous reports**”’, is com- 
posed of an eight-bladed B-propeller domain and an «/B hydrolase 
domain. MERS-CoV RBD binds to the side-surface of the CD26 B- 
propeller, recognizing blades IV and V and a small bulged helix in the 
blade-linker. As for the viral ligand, the entire receptor binding site 
locates in the external subdomain and to the solvent-exposed sheet- 
face, qualifying the subdomain as the receptor binding motif (RBM) 
(Fig. 3a). Overall, engagement of the receptor does not induce obvious 
conformational changes in RBM, although small structural variance 
could be observed for the tip-loops. The n2-«4 loop in the RBD core, 
however, unexpectedly exhibits a large conformational difference 
between the free and the bounded structures (Supplementary Fig. 3). 
We believe this is due to a crystal contact present in the free RBD struc- 
ture, which is interrupted in the complex crystal by the engaging receptor. 
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the Fc-fused S protein or its subdomain proteins involved in CD26 binding. 
Mock-transfected baby hamster kidney (BHK) cells or BHK cells transfected 
with CD26-expressing plasmid (BHK-CD26) were tested with the individual 
Fc-fusion proteins or an anti-CD26 antibody (anti-CD26 IgG). For each test, 
the secondary antibodies (anti-goat IgG or anti-mouse IgG) were used as the 
negative control. The profiles are shown. From left to right: BHK cells with the 
indicated Fc-fusion proteins or antibodies, BHK-CD26 with anti-CD26 
antibody, BHK-CD26 with Fc-fused $1, BHK-CD26 with Fc-fused NTD, BHK- 
CD26 with Fc-fused RBD. c, A surface plasmon resonance assay characterizing the 
specific binding between CD26 and MERS-CoV RBD. The profiles are shown. 
Left, human ACE2 to SARS-CoV RBD; middle, CD26 to MERS-CoV RBD; top 
right, CD26 to SARS-CoV RBD; bottom right, human ACE2 to MERS-CoV RBD. 


CD26 is a type II transmembrane protein. It is present as a homo- 
dimer on the cell surface*”*’. The dimerization of the peptidase relies 
on broad intermolecule contacts contributed by the hydrolase domain 
and the extended strands in blade IV of the B-propeller*’”’. A lateral 
binding of MERS-CoV RBD to CD26 would therefore not disrupt 
CD26 dimerization. Accordingly, a similar U-shaped CD26 dimer 
could be generated by symmetry operations of the complex structure. 
The viral ligand locates at the membrane-distal tip of the dimer, cor- 
responding well to a trans interaction between the virus and the recep- 
tor (Fig. 3b). Considering that the RBD N and C termini are on the 
same side distant from CD26, it is unlikely that the remaining S 
domains would contact the receptor molecule. The binding mode 
revealed by the complex structure is also in good accordance with a 
previous study showing that the virus—-receptor interaction is inde- 
pendent of the peptidase activity of CD26 (ref. 4). The bound RBD 
is far away from interfering with either the substrate/product accessing 
tunnels or the catalytic centre*’”’ (Fig. 3b). 

Overall, a surface area of 1203.4 and 1113.4 A? in CD26 and MERS- 
CoV RBD, respectively, is buried by complex formation (Fig. 4a). 
Scrutinization of the binding interface reveals a group of hydrophilic 
residues at the site, forming a polar-contact (H-bond and salt-bridge) 
network. These interactions are predominantly mediated by the res- 
idue side chains (including RBD Y499 with CD26 R336, N501 with 
Q286, K502 with T288, D510 with R317, E513 with Q344, and D539 
with K267), although CD26 L294 and RBD D510 are observed to 
contact RBD R542 and CD26 Y322, respectively, through the main- 
chain oxygen atom (Fig. 4b). In addition, the bulged helix in CD26 
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Figure 2 | The overall structure of MERS-CoV RBD. a, A cartoon 
representation of the RBD structure. The secondary structural elements are 
labelled according to their occurrence in sequence. The disulphide bonds 
(marked with Arabic numbers 1-4) and N-glycan linked to N410 are shown as 
orange and green sticks, respectively. Core subdomain, magenta; external 
subdomain, cyan. The N and C termini are labelled. b, An amino acid sequence 
alignment between MERS-CoV and SARS-CoV RBDs. The hollow boxes and 


properly positions three hydrophobic residues A291, L294 and 1295 
into close proximity with the RBD amino acids Y540, W553 and V555, 
forming a hydrophobic centre at the interface (Fig. 4c). Further virus 
receptor contacts include V341 and [346 of CD26 packing against 
P515 and the apolar carbon atoms of R511 and E513 in RBD 
(Fig. 4d), and a CD26 N229-linked carbohydrate moiety interacting 
with RBD amino acids W535 and E536 (Fig. 4e). Overall, the virus- 
receptor engagement is dominated by the polar contacts mediated by 
the hydrophilic residues, and mutations of those in RBD (six alanine 
substitutions and one Y499F mutation of the CD26-interacting amino 
acids) completely abrogated its interaction with CD26 (Supplementary 
Fig. 4). The features of these residue interactions are very similar to 
those mediating the interaction between adenosine deaminase (ADA) 
and CD26 (ref. 23). By a pairwise comparison, we unexpectedly found 
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arrows indicate «/319 helices and B-strands, respectively, and are coloured as in 
a. To facilitate comparison, the secondary-structure elements of SARS-CoV 
RBD (PDB code, 2DD8) are marked with spiral (helices) and arrow (strands) 
lines below the sequence. The cysteine residues that form disulphide bonds are 
labelled as in a, and residue N410 with a star. c, A structural alignment between 
MERS-CoV (magenta for core and cyan for external subdomains) and SARS- 
CoV (green) RBDs. 


that all those CD26 residues identified in the virus—receptor interface 
are also involved in ADA binding, indicating a competition between 
ADA and the virus for CD26 receptor. As the ADA-CD26 interaction 
is shown to induce co-stimulatory signals in T cells*’, this may indicate 
a possible manipulation of the host immune system by MERS-CoV 
through competition for the ADA-recognition site. It is also note- 
worthy that those CD26 residues involved in RBD binding are highly 
conserved between human and bat, with only two variations (I295T 
and R317Q), explaining the capability of MERS-CoV using bat CD26 
for cell entry* (Supplementary Fig. 5). 

Coronaviruses can be categorized into three main genera or groups 
(group 1 (alpha), group 2 (beta) and group 3 (gamma) coronaviruses). 
Both MERS-CoV and SARS-CoV belong to the betacoronavirus genus, 
but are classified into different lineage subgroups (subgroup 2b for 


Figure 3 | The complex structure of MERS-CoV RBD bound to CD26. a, A 
cartoon representation of the complex structure. For clarity, only the 
B-propeller domain of CD26 (grey) is shown. Blades IV, V and the intervening 
IV/V linker that recognize RBD are highlighted in green, blue and red, 
respectively. The core subdomain and external RBM are coloured magenta and 
cyan, respectively. The right panel is yielded by clockwise rotation of the left 
panel along a longitudinal axis in the page-face. b, A symmetry-related CD26 


dimer observed in the complex crystal. The two-fold axis is shown as an upright 
arrow. The transmembrane topology of CD26 is indicated with a modelled 
lipid-bilayer membrane. In CD26, the propeller and side openings indicated as 
the substrate entrance/exit tunnels are marked with arrows, and the catalytic 
triad residues are highlighted as spheres. Colour selections are the same as in 
a, and the CD26 «/B hydrolase domain is shown in orange. The N and C 
termini are labelled. 
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Figure 4 | The atomic interaction details at the binding interface. a, An 
overview of the binding interface. CD26 and RBD are shown in surface and 
cartoon representations, respectively, and are coloured as in Fig. 3. The 
carbohydrate moiety linked to CD26 N229 is shown as green sticks. The 
contacting sites (each allocated with an Arabic number 1-4) are further 


SARS-CoV and 2c for MERS-CoV)*. We noted that the spike sequences 
are of low identity among different subgroup members. For example, 
MERS-CoV and SARS-CoV S proteins show a sequence identity of less 
than 28%. Nevertheless, RBDs of the two coronaviruses are homolog- 
ous for the core subdomain. Notably, the three interior disulphide 
bonds in the core are well-aligned for the steric positions in the two 
RBD structures and well-conserved in sequence among betacorona- 
viruses. Conversely, the external RBM region is highly variable in both 
length and residue composition (Supplementary Fig. 6). Consistently, 
no structural homology in this subdomain is observed between MERS- 
CoV and SARS-CoV. Yet it is this subdomain that engages cellular 
receptors. We therefore assume that betacoronaviruses probably have 
a similar core-domain fold in the S protein to present the external 
amino acids with divergent structures for viral pathogenesis, such as 
receptor recognition. 

Our work presents the fifth structure of virus S protein-receptor 
complexes in the Coronaviridae family'® '**’. Taking into account both 
the RBD structure and the binding mode with receptors, MERS-CoV is 
related to SARS-CoV” (a single insertion functioning as RBM) but 
differs from porcine respiratory coronavirus” and NL63 (ref. 18) of 
alphacoronaviruses (multiple discontinuous RBMs) (Supplementary 
Fig. 7). Nevertheless, related structural topologies can still be observed 
in RBDs of these coronaviruses”. We noted that in the RBD-receptor 
complex structures of both MERS-CoV and porcine respiratory cor- 
onavirus the binding interfaces involve a receptor N-glycan. This 
might represent another cross-genus similarity in the Coronaviridae 
family, which supports a proposed common evolutionary origin of 
coronavirus S proteins”. It would therefore be interesting to investigate 
the contribution of the sugar moiety to the virus-receptor interaction 
for MERS-CoV in the future. 

Vaccination remains the most useful measure to combat viral infec- 
tion and transmission. A large number of antibodies show neutralization 
activity by targeting the RBD and thereby disrupting the virus—receptor 
engagement. Therefore, a properly folded RBD could be an ideal immu- 
nogen for vaccination, as demonstrated for SARS-CoV”. A recent report 
indeed shows the presence of S-specific neutralizing antibodies in 
MERS-CoV-infected patients”*. It may be worth attempting to test the 
immunization effect of MERS-CoV RBD in the future. 


METHODS SUMMARY 

Protein expression, purification, crystallization and structure determination. 
Both His-tagged CD26 and MERS-CoV RBD proteins were expressed in insect 
High Five cells using the Bac-to-Bac baculovirus expression system (Invitrogen). 
The recombinant proteins were then purified via nickel-chelated affinity chro- 
matography and gel filtration. Crystals were obtained by initial screening with the 
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delineated in b-e for the amino acid interaction details. b, A strong polar- 
contact (H-bond and salt-bridge) network. c, d, The small patches of 
hydrophobic interactions. e, Contribution of the carbohydrate moiety. The 
residues involved are shown and labelled. NAG, N-acetyl-D-glucosamine; 
BMA, beta-D-mannose. 


commercially available kits followed by optimization. The RBD structure was 
solved by single-wavelength anomalous diffraction and the complex structure 
by molecular replacement. 


Full Methods and any associated references are available in the online version of 
the paper. 
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METHODS 


Protein expression and purification. The proteins used for crystallization and 
surface plasmon resonance experiments were prepared with the Bac-to-Bac bacu- 
lovirus expression system (Invitrogen). The coding sequences for MERS-CoV 
RBD (GenBank accession number JX869059, spike residues 367-606), SARS- 
CoV RBD (accession number NC_004718, spike residues 306-527), human 
CD26 (accession number NP_001926, residues 39-766) and human ACE2 (acces- 
sion number BAJ21180, residues 19-615) were individually cloned into the 
pFastBacl vector. For each construct, a previously described gp67 signal peptide 
sequence’ was added to the protein N terminus for protein secretion, and a hexa- 
His tag was added to the C terminus to facilitate further purification processes. 
Transfection and virus amplification were conducted with Sf9 cells, and the 
recombinant proteins were produced in High Five cells. The cell culture was 
collected 48h after infection and passed through a 5-ml HisTrap HP column 
(GE Healthcare). After removal of most of the impurities, the recovered proteins 
were then pooled and further purified on a Superdex 200 column (GE Healthcare). 
Finally, each collected protein was prepared in a buffer consisting of 20 mM Tris- 
HCl (pH 8.0) and 150 mM NaCland concentrated to about 10 mg ml * for further 
use. 

To obtain the complex of MERS-CoV RBD bound to CD26, the individual 
proteins were in vitro mixed at a molar ratio of 1:1 and incubated at 4 °C for about 
2h. The complex was then further purified on a Superdex 200 column, and con- 
centrated to about 15mg ml * for crystallization experiments. 

To prepare the Fc chimaeric proteins, the fragment encoding MERS-CoV S1 
(residues 1-751) or NTD (residues 1-353) or RBD (adding the S residues 1-17 of 
the signal peptide to its N terminus to facilitate protein secretion) was fused 5’- 
terminally to a fragment coding for the Fc domain of mouse IgG and ligated into 
the pCAGGS expression vector. A mutant RBD-Fc protein-expressing plasmid 
was also constructed by site-directed mutagenesis, for which the identified hydro- 
philic residues involved in CD26 binding were mutated simultaneously (Y499F; 
NS501A, K502A, D510A, E513A, D539A and R542A). The expression plasmids 
were then transfected into HEK293T cells. The cell culture was collected 48 h after 
transfection and directly used in the flow cytometric assay. 

Analytical gel filtration. MERS-CoV RBD, CD26 and their protein complex were 
individually prepared and adjusted to the same volume. The samples were then 
loaded onto a calibrated Superdex 200 column (GE Healthcare). The chromato- 
graphs were recorded and overlaid onto each other. The pooled proteins were 
analysed on a 12% SDS-PAGE gel and stained with Coomassie blue. 

Surface plasmon resonance assay. The BiAcore experiments were carried out at 
room temperature (25 °C) using a BIAcore 3000 machine with CM5 chips (GE 
Healthcare). For all the measurements, an HBS-EP buffer consisting of 10 mM 
HEPES, pH 7.5, 150 mM NaCl, 3 mM EDTA and 0.005% (v/v) Tween-20 was used, 
and all proteins were exchanged to the same buffer in advance via gel filtration. The 
MERS-CoV RBD and SARS-CoV RBD proteins were immobilized on the chip at 
about 500 response units. Gradient concentrations of human CD26 (0,5, 10, 20, 40, 
80, 160, 320, 640 and 1,280nM) or human ACE2 (0, 10, 20, 40, 80, 160, 320, 640 
and 1,280 nM) were then used to flow over the chip surface. After each cycle, the 
sensor surface was regenerated via a short treatment using 10 mM NaOH. The 
binding kinetics were analysed with the software BIAevaluation Version 4.1 using 
the 1:1 Langmuir binding model. 

Flow cytometric assay. For the surface expression of CD26, the full-length coding 
sequence was cloned into the pEGFP-C1 vector which yields a plasmid encoding a 
recombinant CD26 protein with an EGFP-tag fused to its N terminus. The plasmid 
was transfected into the CD26-negative BHK cells using lipo2000 (Invitrogen) 
according to the manufacturer’s instructions. The cells were collected 48h after 
transfection. 

For staining, the mock-transfected BHK cells or the cells transfected with the 
CD26-expressing plasmid were suspended in PBS and incubated with the indi- 
vidual Fc-fusion protein culture or goat anti-CD26 IgG (R&D Systems) at room 
temperature for 1h. The cells were then washed and further incubated at room 
temperature for about 0.5 h with anti-mouse or anti-goat secondary IgG antibodies 
(R&D Systems). After washing, the cells were analysed by flow cytometry with a 


BD FACSCalibur machine. The cells incubated only with the secondary antibodies 
were used as the negative controls. 

Crystallization. All the crystals were obtained by vapour-diffusion sitting-drop 
method with 1 pl protein mixing with 1 tl reservoir solution and then equilibrat- 
ing against 100 ul reservoir solution at 18 °C. The initial crystallization screenings 
were carried out using the commercially available kits. The conditions that yield 
crystals were then optimized. Diffractable crystals of the free RBD protein were 
finally obtained in a condition consisting of 0.1 M ammonium tartrate dibasic, 
pH7.0, and 12% PEG 3,350 with a protein concentration of 10 mg ml *. Deriva- 
tive crystals were obtained by soaking RBD crystals for 24h in mother liquor 
containing 2mM KAuCl,*2H,0O. The complex crystals were grown in 6% (v/v) 
2-propanol, 0.1 M sodium acetate pH 4.5 and 26% PEG550 with a protein con- 
centration of 15mgml ’. 

Data collection, integration and structure determination. For data collection, 
all crystals were flash-cooled in liquid nitrogen after a brief soaking in reservoir 
solution with the addition of 20% (v/v) glycerol. The native RBD data set was 
collected at the High Energy Accelerator Research Organization (KEK) BL1A 
(wavelength, 1.03818 A), whereas the diffraction data for the Au derivative crystal 
(wavelength, 1.0382 A) and the complex crystal (wavelength, 0.97930 A) were 
collected at the Shanghai Synchrotron Radiation Facility (SSRF) BL17U. All data 
were processed with HKL2000 (ref. 30). Additional processing was performed 
with programs from the CCP4 suite*’. 

The structure of RBD was determined by the single-wavelength anomalous 
diffraction (SAD) method. The Au sites were first located by SHELXD” for the 
Au-SAD data. The identified position were then refined and the phases were 
calculated with SAD experimental phasing module of Phaser*’. The real space 
constraints were further applied to the electron density map in DM™. The initial 
model was built with Autobuild in Phenix package*’. Additional missing residues 
were added manually in Coot*®. The final model was refined with phenix.refine in 
the Phenix” with energy minimization, isotropic ADP refinement, and bulk solv- 
ent modelling. The complex structure was solved by molecular replacement mod- 
ule of Phaser*’, with the solved RBD structure and previously reported CD26 
structure (PDB code, 2BGR) as the search models. The atomic model was com- 
pleted with Coot*® and refined with phenix.refine*’. The stereochemical qualities 
of the final models were assessed with PROCHECK”’’. The Ramachandran plot 
distributions for the residues in the free RBD structure were 86.8, 11.8 and 1.4% 
for the most favoured, additionally and generously allowed regions, respectively. 
These values were 86.5, 13.1 and 0.5% for the RBD-CD26 complex structure. Data 
collection and refinement statistics are summarized in Supplementary Table 1. All 
structural figures were generated using Pymol (http://www.pymol.org). 
Secondary-structure determination. The secondary structure determination was 
based on the ESPript** algorithm. 
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